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Abstract 

Team formation is a core problem in AI. Remarkably, 
little prior work has addressed the problem of mech¬ 
anism design for team formation, accounting for the 
need to elicit agents’ preferences over potential team¬ 
mates. Coalition formation in the related hedonic games 
has received much attention, but only from the perspec¬ 
tive of coalition stability, with little emphasis on the 
mechanism design objectives of true preference elici¬ 
tation, social welfare, and equity. We present the first 
formal mechanism design framework for team forma¬ 
tion, building on recent combinatorial matching market 
design literature. We exhibit four mechanisms for this 
problem, two novel, two simple extensions of known 
mechanisms from other domains. Two of these (one 
new, one known) have desirable theoretical properties. 
However, we use extensive experiments to show our 
second novel mechanism, despite having no theoretical 
guarantees, empirically achieves good incentive com¬ 
patibility, welfare, and fairness. 


Introduction 


Teamwork has been an important and often-studied area of 
artificial intelligence research. Typically, the focus is on co¬ 
ordinating agents to achieve a common goal. The comple¬ 
mentary problem of team formation considers how to form 
high-quality teams, whose agents have skills that are jointly 
well suited for a task ( jMarcolino, Jiang, and Tambe 2013| l. 
Notable team formation applications include formation of 
research teams, class project groups, groups of roommates, 
or disaster relief teams. 

Many prior team formation studies have assumed that 
agents are indifferent about which other agents they are 
teamed with, or have preferences known to the team forma¬ 
tion mechanism. Models dealing with known agent prefer¬ 
ences over teammates, termed hedonic games, have seen an 
extensive literature since being introduced by Aumann and 
|Dreze| ( T974| i. In a hedonic game, the mechanism is given 
a set of agents, each having public preferences over which 
others might be on its team; the mechanism must partition 
the agents into teams based on their preferences. 
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Past research on hedonic games has focused on the prob¬ 
lem of forming stable coalitions, from which no set of agents 
would prefer to defect. Since a core partition may not ex¬ 
ist in a hedonic game, even when preferences of players 
are additively separable ( |Banerjee, Konishi, and S6nmez| 
|2001|l, much research is focused on alternative notions of 


stability, or on highly restricted agent preferences (Bo- 
gomolnaia and Jackson 2002[ [Alcalde and Revilla 2004[ 


Cechlarova and Romero-Medina 2 001 |, or on the time com- 
plexity of testing core emptiness (|Ballester 2004| |Sung and 
IDimitrov 20T0| ). 


We consider team formation as a mechanism design prob¬ 
lem, where individuals have preferences over teammates, as 
in hedonic games. As in traditional mechanism design (and 
unlike hedonic games), we assume that these preferences 
are private and must be elicited in order to partition play¬ 
ers reasonably into teams. We draw a connection to another 
budding literature, that of combinatorial matching market 
design, which has course allocation as a typical applica¬ 
tion ( |Budish and Cantillon 2012) . 


An important concern in combinatorial matching, which 
we inherit, is the ex post fairness of allocations. Eor ex¬ 
ample, consider a simple randomized mechanism, random 
serial dictatorship, which has been proposed for course al¬ 
location and is readily adapted to team formation. In ran¬ 
dom serial dictatorship, agents are randomly ordered by the 
mechanism and then take turns, in order, selecting their en¬ 
tire teams from among the remaining agents. Random se¬ 
rial dictatorship is st rate gyp roof, meaning that it is a domi¬ 
nant strategy for any agent to report its true preferences over 
teams. Random serial dictatorship is also ex post Pareto ef¬ 
ficient, in that any allocation it returns cannot be modified 
to improve an agent’s welfare without reducing some other 
agent’s (assuming no indifferences). But this mechanism re¬ 
sults in a highly inequitable distribution of outcomes ex post. 

Budish and Cantillon (20121 proposed a more sophisti¬ 
cated alternative, approximate competitive equilibrium from 
equal incomes (A-CEEI), which is strategyproof-in-the- 
large (i.e., when the number of players becomes infinite), 
and provably approximately fair (|Budish and Cantillon 


|2012[ [Budish 20111 1. The work on combinatorial match¬ 
ing in turn follows earlier work on bipartite matching and 
school choice ([Roth and Peranson 1999| Abdulkadiroglu 
[and Sonmez 2003l [ ' 



















































Our contributions are as follows. 

1. We present the problem of mechanism design for team 
formation, focused on achieving (near-)incentive compat¬ 
ible preference reporting, high social welfare, and fair al¬ 
location. This problem is closely related to both combina¬ 
torial and bipartite matching market design, but is distinct 
from both in two senses: first, the matching is not bipar¬ 
tite (players match to other players), and therefore typical 
matching algorithms which only guarantee strategyproof- 
ness for one side are unsatisfactory; and second, mecha¬ 
nisms used in combinatorial exchanges to provide fairness 
guarantees are not directly applicable, as they rely on hav¬ 
ing a fixed set of items which are the subject of the match 
and which are not themselves strategic; 

2. we extend two well-known mechanisms (random serial 
dictatorship and Harvard Business School draft) used for 
combinatorial matching to our setting; 

3. we propose two novel mechanisms for our setting (A- 
CEEI for team formation, or A-CEEI-TE, and one-player- 
one-pick draft, or OPOP); 

4. we prove that A-CEEI-TE is approximately fair and 
strategyproof-in-the-large; 

5. we offer empirical analysis of all mechanisms, which 
shows that our second mechanism, OPOP, outperforms 
others on most metrics, and has better incentive proper¬ 
ties than A-CEEI-TE. 

An important and surprising hnding of our investigation is 
that the simple draft mechanism we propose empirically 
outperforms the more complex A-CEEI-TE alternative by 
a large margin in fairness and incentive compatibility, even 
while A-CEEI-TE has more compelling theoretical guaran¬ 
tees. 


Mechanism Design Problem 

Our point of departure is the formalism of hedonic games. 
We dehne a hedonic game as a tuple {N, y), where N is 
the set of players, and is a vector containing each player’s 
preference order over sets of other players that it could be 
teamed with. The task is to partition the players in N into a 
coalition structure, where each player is in exactly one coali¬ 
tion. 

We assume that player preferences are additively separa¬ 
ble ( |Aziz, Brandt, and Seedig 201 l| l, which means that there 
exists an assignment of values Ui{j) for all players i and 
their potential teammates j, so that i’s total utility of a subset 
of others S is (which induces a corresponding 

preference ordering over subsets of possible teammates). In 
addition, we assume that Ui{j) > 0 for all i,j. 

These assumptions are useful for two reasons. Eirst, in 
many data sets that record preferences of individuals over 
others, the preferences are entered as non-negative values 
for individuals, as in rank order lists or Likert ratings of indi¬ 
viduals. Additive separable preferences are the most natural 
way to induce preferences over groups from such data. Sec¬ 
ond, many prior studies in team formation and the related 
domain of course allocation have assumed that agents have 


non-negative, additive separable preferences, as in the B- 
preferences of ( |Cechlarova and Romero-Medina 201)1) and 
in the bidding points auction. 

Most prior work on hedonic games focuses on coalition 
stability. Our goal is distinct: We take as input player pref¬ 
erences over teams (that is, over others that they could be 
teamed with), which we assume to be additive with non¬ 
negative values, and output a partition of the players into 
teams. We assume that it is subsequently difficult for play¬ 
ers to alter team membership. Our primary challenge, there¬ 
fore, is to encourage players to report their preferences hon¬ 
estly, and form teams that are fair and yield good teammate 
matchings; all three notions shall be made precise presently. 
Note that in this construction we assume that no money can 
change hands (unlike the work by Li et al. ( |2004| l). 

Observe that in our model, all players always prefer to be 
put on a single team (since values for all potential teammates 
are positive). In reality, many team formation problems have 
hard constraints on team sizes (or, equivalently, on the num¬ 
ber of teams), particularly when multiple tasks need to be 
accomplished. Eor example, project teams usually have an 
upper bound on size. We capture this by introducing team 
size constraints; formally, the size of any team must be in 
the interval [k,k], with k > 1, k < |A^|, and k < k. Eor 
example, if a classroom with 25 students must be divided 
into 6 approximately equal-size teams, we could have k = A 
and k = 5. We assume throughout that the specific values of 
k and k admit a feasible allocation. (This is not always the 
case; see supplemental material for details.) 

In contrast with a typical approach in mechanism design, 
which seeks to maximize a single objective such as social 
welfare or designer revenue, subject to a constraint set, we 
take an approach from the matching market design litera¬ 
ture, and seek a collection of desirable properties (see, e.g., 
Budish (2012|l). Specihcally, we consider three properties: 


incentive compatibility, social welfare, and fairness. Given 
the fact that all three cannot be achieved simultaneously in 
our setting, we will analyze the extent to which each can be 
achieved through specific mechanisms. 


Incentive Compatibility Incentive compatibility holds if 
there is no incentive for an agent to misreport its preferences. 
We consider two forms of incentive compatibility: strate- 
gyproofness, which means that it is a dominant strategy for 
any agent to report its true preferences, and ex post equilib¬ 
rium, which means that it is a Nash equilibrium for all agents 
to report their true preferences. The former will be consid¬ 
ered in theoretical analysis, while the latter will be the focus 
of empirical incentive assessment. In particular, our theory 
will focus on strategyproofness-in-the-large ( |Budish 201 l) l, 
defined as follows. Consider a market where each agent has 
been replaced with a measure-one continuum of replicas of 
itself, such that each individual agent has zero measure and 
all agents are price takers. A mechanism is strategyproof-in- 
the-large if, in such a market, it is a dominant strategy for 
each agent to reveal its true preferences. An example of a 
mechanism that is not strategyproof-in-the-large is the Har¬ 
vard Business School draft considered below, in which an 



















agent may benefit from misreporting its preferences, regard¬ 
less of its own measure relative to the market size (|Budish| 


and Cantillon 20121. In empirical analysis, in contrast, we 


determine a lower bound on the regret of truthful reporting, 
which is the most any agent can gain ex post by misreporting 
preferences when all others are truthful. 


either. Some other agent j must then be on a team with the 
two most-preferred agents of the player i. By construction, 
player f is on a team of value 3 or less, while the team of 
agent j has value 12 to agent i, and value 4 to agent i with its 
more valuable player (player A) removed. Therefore, envy 
cannot be bounded by a single teammate for all agents. □ 


Social Welfare As in traditional mechanism design, we 
consider social welfare as one of our primary design cri¬ 
teria. Social welfare is just the sum of player utilities 
achieved by a specific partition of players into teams. For¬ 
mally, if Q is a partition of players, social welfare is de¬ 
fined as SW{Q) = 

we consider the weaker notion of ex post Pareto optimality 
when discussing alternative mechanisms and their theoret¬ 
ical properties. A partition of players Q is ex post Pareto 
optimal if no other partition strictly improves some agent’s 
utility without lowering the utility of any other agent. 


Fairness The measure of fairness we consider is envy- 
freeness. An allocation is envy-free if each agent weakly 
prefers its own allocation to that of any other agent. An ap¬ 
proximate notion of envy-freeness that we adopt from Bud- 
ish ( 201 l| l is envy bounded by a single teammate, in which 
any allocation an agent prefers to their own ceases to be pre¬ 
ferred through removal of a single teammate from it|^ The 
following negative result makes apparent the considerable 
challenge associated with the design problem we pose. 


Team Formation Mechanisms 

We describe four mechanisms for team formation; two are 
straightforward applications of known mechanisms, while 
two are novel. 


Random Serial Dictatorship 

Random serial dictatorship (RSD) has previously been pro 
posed in association with school choice problems ([Abdulka- 


diroglu and Sonmez 2003i . In RSD, players are randomly 
ordered, and each player chosen in this order selects his team 
(with players thereby chosen dropping out from the order). 
The process is repeated until all players are teamed up. 


Proposition 2. Random serial dictatorship is strategyproof 
and ex post Pareto efficient as long as players choosing later 
cannot choose a larger team. 

While RSD is ex post Pareto efficient, this turns out to be a 
weak guarantee, and does not in general imply social welfare 
maximization, something that becomes immediately appar¬ 
ent in the experiments below. Envy-freeness is, of course, 
out of the question due to Proposition[2 


Proposition 1. There may not exist a partition of players 
that bounds envy by a single teammate. 

Proof Consider a team formation problem with 6 agents, 
{A, B, C, D, E, F}, fc = 3, fc = 3, so that two equal-size 
teams must be formed. The agents’ additive separable pref¬ 
erences are encoded in Table 



A 

B 

C 

D 

F 

F 

A 

X 

0 

1 

2 

4 

8 

B 

8 

X 

4 

2 

1 

0 

C 

8 

0 

X 

4 

2 

1 

D 

8 

1 

0 

X 

4 

2 

E 

8 

2 

1 

0 

X 

4 

F 

8 

4 

2 

1 

0 

X 


Table 1; Each row i encodes the additive separable value for 
agent i of each other agent. 

No partition of these agents into two teams of size 3 gives 
every agent envy bounded by a single teammate. To see this, 
consider that each agent other than A has a bliss point on a 
team with A and one other agent, where the second agent is 
C for agent B, D for agent C, and so on until “wrapping 
around” with B for agent F. Three of the agents will not be 
on a team with agent A, and at least one of these agents, say 
agent i, will not be on a team with its second-favorite agent 


Harvard Business School (HBS) Draft 

Players are randomly ordered, with the first T assigned as 
captains. We then iterate over captains, first in the random 
order, then in reverse, alternating. The current team captain 
selects its most-preferred remaining player to join its team, 
based on its reported preferences. 

Proposition 3. BBS draft is not strategyproof or ex post 
Pareto efficient. 

One-Player-One-Pick (OPOP) Draft 

Players are randomly ordered. Given the list of team sizes, 
the first T players are assigned to be captains of the respec¬ 
tive teams. Then iterate over the complete player list. If the 
next player is a team captain, it selects its favorite unas¬ 
signed agent to join its team. If the next player is unassigned, 
it will be assigned to join its favorite incomplete team (as 
defined below), and if the team still has space, this player 
chooses its favorite unassigned agent to join them. We de¬ 
fine a “favorite” incomplete team for an agent as follows. 
Let S be an incomplete team with vs vacancies. Let the 
mean value to player i of the unassigned players be fXi. We 
then assign the following utility of an incomplete team S to 
agent i: + (vs - 1)Mz- 

Proposition 4. The One-Player-One-Pick draft is not strat¬ 
egyproof or ex post Pareto efficient. 


*In the supplemental material we discuss another measure of 
fairness. 


^The proofs of this and other results are in the supplemental 
material. 


















Competitive Equilibrium from Equal Incomes 

We now propose a more complex mechanism, based on 
the Competitive Equilibrium from Equal Incomes (CEEI), 
which is explicitly designed to achieve allocations that are 
more ex post fair than the alternative mechanisms. 

We begin by defining CEEI, previously introduced by 
Varian \\91A\ . Given a set of agents N, a set of goods C, and 
agent preferences over bundles of goods y, a CEEI mecha- 
nism finds a budget 6 G M+ and price vector p* G , such 
that if each agent is allocated its favorite bundle of goods that 
costs no more than b, then each good in C is allocated to ex¬ 
actly one agent in N, or divided in fractions summing to 1 
among the N. In combinatorial allocation problems, such as 
course allocation, goods (seats in a class) are not divisible, 
and certain bundles of goods (class schedules) are not al¬ 
lowed to be assigned to an agent. As a result, an exact mar¬ 
ket clearing tuple {b,p*) may not exist. To deal with this 
difficulty, CEEI was relaxed by Budish ( |201 l| l to an approx¬ 
imate version, termed A-CEEI. A-CEEI works by assigning 
nearly equal budgets to all agents, then searching for an ap¬ 
proximately market clearing price vector and returning the 
allocation induced by those prices. The result may not clear 
the market exactly, but there is an upper bound on the worst- 
case market clearing error. The resulting allocation satisfies 
an approximate form of envy-freeness ( |Budish 201 1| |. 

Both CEEI and A-CEEI take advantage of the dichotomy 
between agents and items which agents demand. This makes 
our setting distinct; agents’ demand in team formation is 
over subsets of other agents. A technical consequence is 
that this gives rise to a hard constraint for CEEI that if an 
agent i is paired with agent j, than j must also be paired 
(assigned to) agent i; any relaxation of this constraint fails 
to yield a partition on the agents and consequently does not 
result in an admissible mechanism. We therefore design an 
approximation of CEEI, termed A-CEEI-TF, that accounts 
for the specific peculiarities of our setting. Conceptually, the 
A-CEEI-TE mechanism works by alternating between two 
steps. Eirst, it searches in price space for approximate re¬ 
laxed market-clearing prices. Second, it assigns a randomly 
selected unmatched agent to form a team with its favorite 
bundle of free agents that is affordable, based on current 
prices. The result is a mechanism that is strategyproof-in- 
the-large, and more fair than random serial dictatorship. 

A key part of A-CEEI-TF is a price update function, 
which reflects the constraints of the team formation prob¬ 
lem. We use a tatonnement-like price update function / in 
an auxiliary price space V = [—l,l-|-5]l^ I, where N' is the 
set of agents remaining (unassigned) at an iteration of the al¬ 
gorithm, and b is the supremum of allowable agent budgets. 
We make two requirements of a price update function, one 
ensuring that the iterative updates are well-defined, another 
to ensure that fixed points of the process are actual solutions. 

Definition 1. A price update function / is admissible if (a) 
its fixed points correspond to (relaxed) market clearing, and 
(b)V is closed under f. 


Algorithm 1 A-CEEI-TF Algorithm Outline. 

Require: (N,';^,k,k) 

1: Randomly assign approximately equal budgets bi to the 
agents, bi G [1,6], & < l-f 1/|A^|. 

2; Randomly order the agents. 

3: Search for a price vector p in price space V = [0, b]^^ I 
that approximately clears the (relaxed) market among 
the N' remaining agents, given agent budgets b. 

4: Take the next unmatched agent in the random order, and 
assign it to join its favorite bundle of other free agents 
that it can afford at the current prices, and that leaves 
a feasible subproblem—i.e., feasible {N',k,k). If the 
agent cannot afford any remaining bundle of legal size 
that leaves a feasible subproblem, the agent is assigned 
its favorite remaining bundle of legal size that leaves a 
feasible subproblem. 

5: Repeat steps 3 and 4 until each agent is on a team. 


We now define a candidate price update function, fTF- 


fTF{P)j = t{p)j + 


(l + e- {e/b)t(P)j)Dj -Uj 
\N'\ 


( 1 ) 


where Dj is the number of agents that demand j but whom 
j does not demand, Uj = 1 if and only if no other agent 
demands j, and 0 otherwise, b is the supremum of allowable 
agent budgets, and tf) is a truncation function, which takes 
a price vector p and truncates it to the [0, b] interval. 

Proposition 5. pTFi') admissible. 

While admissibility of /tf(') alone does not guarantee 
convergence of the iterative process, it does guarantee that 
if convergence happens, we have a solution. The following 
proposition characterizes some of the properties such solu¬ 
tions possess. 

Proposition 6. A-CEEI-TF is strategyproof-in-the-large. In 
addition, if A-CEEI-TF yields exact market clearing and in¬ 
duces the same allocation at each stage of price search, it 
yields envy bounded by a single teammate. 


Proof We sketch a proof of strategyproofness-in-the-large. 

If a team formation problem is modified such that each 
agent is replaced with a measure-one set of copies of itself, 
each copy being measure zero, we arrive at what is called 
a continuum economy. If we run A-CEEI-TF in the con¬ 
tinuum economy, any individual agent, being zero-measure, 
has no influence on the approximate equilibrium price vector 
arrived at by update function any iteration of the 

A-CEEI-TF mechanism. Therefore, the only effect the agent 
can have on the outcome is that, if the agent is randomly 
selected to choose its favorite affordable team of available 
agents that leaves a feasible subproblem, the agent’s re¬ 
ported preferences determine which team the agent is as¬ 
signed. Thus, it is a dominant strategy for the agent to report 
its true preferences, so that in this case the agent will be as¬ 
signed its most-preferred allowable team. □ 












Experiments 

Although RSD and A-CEEI-TF possess desirable theoreti¬ 
cal properties, these results are loose, and the only approx¬ 
imate fairness guarantee, shown for A-CEEI-TE, requires 
strong assumptions on the environment. We now assess all of 
the proposed mechanisms empirically through simulations 
based both on randomly generated classes of preferences, as 
well as real-world data. Our empirical results turn out to be 
both one-sided (if one is interested in achieving all three de¬ 
sired properties) and surprising; OPOP, a mechanism with 
no provable theoretical guarantees, tends to outperform oth¬ 
ers in fairness, and to perform nearly as well as the best other 
mechanism in truthfulness and social welfare. 

Data Sets: We use both randomly generated data and data 
from prior studies on preferences of human subjects over 
each other; 


To measure agent preference similarity in a data set, we let 
C equal the mean cosine similarity among all pairs of distinct 
agents in the data set. Each agent assigns itself a value of 0 or 
undefined, so we take the cosine similarity between agents i 
and 2 only over their values for agents in \ 

~ '^i\ 

^ —z-t-l II II 

{\N\^-\N\)/2 

In Table 1^ we present the mean cosine similarity for each 
data set we discuss in this paper. Higher cosine similarities 
indicate greater agreement among agents about the relative 
values of other agents. We also show the number of agents 
in each data set. 


• Random-similar (R-sim) ||Othman, Sandholm, and| 
Budish 20i0| : Each agent i, i S jl, 2,..., |W|}, is as- 
signed the public value i. A private error term is added to 
the public value of i to derive the value of i to each other 
agent j, drawn independently from a normal distribution 
with zero mean and standard deviation |W|/5. The private 
error term is redrawn until the sum of the private error 
term and public value is non-negative. Then the value of i 
to j is the sum of i and the private error term. 

• Random-scattered (R-sca): In this data set class, the 
value of a player i is generated independently by each 
other player j. To determine the value of other players to 
player j, a total value of 100 is divided at random among 
the other players as follows. Uniformly random numbers 
G [0,100] are taken, to divide the region into |A^| — 1 re¬ 
gions. The random draws for agent j are sorted, produc¬ 
ing |iV| — 1 values for the other agents, as the differences 
between consecutive draws in sorted order. 


• Newfrat: This data set comes from a widely cited study 
by Newcomb, in which 17 students at the University of 
Michigan in 1956 ranked each other in terms of friendship 
ties. We use the data set from the final week, NEWC15 
( Newcomb 1958| . We let fc = 4, A: = 5. 


• Freeman: The data are from a study of email messages 
sent among 32 researchers in 1978. We use the third ma¬ 
trix of values from the study. The data show how many 
emails each researcher sent to each other during the study, 
whic h we use as a proxy for the stre ngth of directed social 
links (Freeman and Freeman 19791. We let k = 5, k = 6. 


For the randomly generated data sets, we set | A^j = 20 and 
k = k = 5, and our results are averaged over 20 generated 
preference rankings for all players. 

The four classes of data set we analyze differ most 
saliently in their number of agents, \N\, and in the degree 
of similarity among the player preferences. For example. 
Random-similar agents largely agree on which other agents 
are most valuable, while Random-scattered agents have lit¬ 
tle agreement. Differences in degree of preference similarity 
lead to marked differences in the performance outcomes of 
the various mechanisms. 



C 

\N\ 

k 

k 

Random-similar 20 

0.914 

20 

5 

5 

Random-scattered 20 

0.499 

20 

5 

5 

Newfrat 

0.877 

17 

4 

5 

Freeman 

0.551 

32 

5 

6 


Table 2; Mean cosine similarity over all pairs of distinct 
agents; number of agents; minimum team size; and maxi¬ 
mum team size. For random data set classes, C as shown is 
the mean over 20 randomly generated instances of the class. 


Empirical Analysis of Incentive Compatibility: To study 
the incentive compatibility of the mechanisms, we used a 
protocol similar to that used by Vorobeychik and Engel 
for estimating the regret of a strategy profile ([Vorobeychik 


[and Engel 2011 1 . We ran each mechanism on 8 versions of 
each data set, with different random orders over the play¬ 
ers, which we held in common across data sets. We generate 
deviations from truthful reporting for agent j one at a time 
until 25 unique deviations have been produced. To produce a 
deviation from an agent’s truthful values for other agents, we 
first randomly select a number of pairs of values to swap ac¬ 
cording to a Poisson distribution with A = 1, with 1 added. 
For each pair of values to swap, we first select the rank of 
one of them, with lower (better) rank s more likely. 

The results are shown in Table (30 

RSD is not shown, since it is provably strategyproof, 
but, remarkably, A-CEEI-TF empirically produces higher 
(worse) regret of truthful reporting than HBS or OPOP, 
even though A-CEEI-TF is strategyproof-in-the-large, and 
the others are not. Both HBS and OPOP appear to offer play¬ 
ers only small incentives to lie, with HBS slightly better. 


Social Welfare: To facilitate comparison, we normalize the 
total utility of all teammates for each agent to 1, so that so¬ 
cial welfare (already normalized for the number of players) 
falls in the [0,1] interval. For comparison, we also include 


^We only report results for the two random data sets and 
Newfrat, as it was not feasible to rigorously analyze regret for the 
far larger Freeman data set. (However, the Freeman data set is sim¬ 
ilar to Random-scattered; see supplemental material for details.) 























R- 

sim. 

R 

-sea. 

Newfrat 

HBS 

0.02 

± 

0.02 

0.04 

± 

0.02 

0.03 

± 

0.02 

OPOP 

0.07 

± 

0.02 

0.10 

± 

0.05 

0.06 

± 

0.02 

A-CEEI-TP 

0.19 

± 

0.04 

0.29 

± 

0.07 

0.19 

± 

0.03 

Max-welfare 

0.20 

± 

0.03 

0.29 

± 

0.07 

0.22 

± 

0.03 


Table 3; Mean maximum observed regret of truthful report¬ 
ing, with 95% confidence intervals. 

optimal social welfare for both Random data sets, as well as 
Newfrat0 

The only related theoretical result is that RSD is ex post 
Pareto optimal; the other three mechanisms do not even pos¬ 
sess this guarantee. This makes our results, shown in Ta¬ 
bles 1^ and 1^ remarkable: on Random-similar and Newfrat 
data sets (both with preferences relatively similar across 
players), there is little difference in welfare generated by the 
different mechanisms, but on Random-scattered and Free¬ 
man data sets, OPOP statistically significantly outperforms 
the others. 



R 

-sim. 

R 

-sea. 

RSD 

0.22 

± 

0.004 

0.25 

± 

0.01 

A-CEEI-TP 

0.22 

± 

0.004 

0.25 

± 

0.01 

HBS 

0.22 

± 

0.004 

0.25 

± 

0.02 

OPOP 

0.22 

± 

0.003 

0.27 

± 

0.01 

Max-welfare 

0.25 

± 

0.001 

0.35 

± 

0.01 


Table 4: Mean social welfare for the two Random data sets, 
with 95% confidence intervals. 



Newfrat 

Ereeman 

RSD 

0.23 

± 

0.01 

0.20 

± 

0.01 

A-CEEI-TP 

0.23 

± 

0.01 

0.19 

± 

0.01 

HBS 

0.22 

± 

0.05 

0.20 

± 

0.01 

OPOP 

0.22 

± 

0.05 

0.24 

± 

0.02 

Max-welfare 

0.27 

± 

0.00 


- 



Table 5: Mean social welfare for the Newfrat and Freeman 
data sets, with 95% confidence intervals. 

Fairness: Fairness of an allocation (in our case, a parti¬ 
tion of players) can be conceptually described as the relative 
utility of best- and worst-off agents. Formally, we measure 
fairness in the experiments as the fraction of agents whose 
envy is bounded by a single teammate (as defined above). 

Our fairness results, shown in Tables and are unam¬ 
biguous: RSD is always worse, typically by a significant 
margin, then the other mechanisms. This is intuitive, and is 
precisely the reason why alternatives to RSD are commonly 
considered. What is far more surprising is that A-CEEI-TF, 
in spite of some theoretical promise on the fairness front, and 
in spite of being explicitly designed for fairness, is in all but 
one case the second worst. While HBS and OPOP are com¬ 
parable on the high-similarity data sets (Random-similar and 

"’ll was infeasible to compute this for the Freeman data set due 
to its size. 


Newfrat), it dominates all others on the dissimilar data sets 
(Random-scattered and Ereeman). 



R- 

sim. 

R 

-sea. 

RSD 

0.43 

± 

0.03 

0.59 

± 

0.05 

A-CEEI-TP 

0.66 

± 

0.05 

0.62 

± 

0.05 

HBS 

0.71 

± 

0.04 

0.61 

± 

0.06 

OPOP 

0.70 

± 

0.06 

0.79 

± 

0.04 


Table 6: Mean fraction of agents with envy bounded by a 
single teammate for the two Random data sets, with 95% 
confidence intervals. 



Newfrat 

Ereeman 

RSD 

0.36 ±0.02 

0.43 ± 0.05 

A-CEEI-TP 

0.57 ±0.03 

0.55 ±0.05 

HBS 

0.67 ±0.05 

0.64 ±0.04 

OPOP 

0.68 ±0.07 

0.78 ±0.05 


Table 7: Mean fraction of agents with envy bounded by a 
single teammate for the Newfrat and Ereeman data sets, with 
95% confidence intervals. 

Next, we consider informal perspectives on the fairness of 
the various mechanisms. Eor mechanisms that use a random 
serial order over players, we can study typical outcomes for 
a player given its serial index. If players with lower (better) 
serial indexes receive drastically better outcomes than agents 
with higher (worse) indexes, such a mechanism is not very 
fair. In Eigure [T] (left), we plot a smoothed version of the 
mean fraction of total utility achieved by agents at each ran¬ 
dom serial index from 0 to 19, for the mechanisms RSD, 
HBS draft, OPOP draft, and A-CEEI-TE Results are based 
on 20 instances of Random-scattered preferences, held in 
common across the mechanisms, with different serial orders 
over the players. Prom this Eigure, it is apparent that ran¬ 
dom serial dictatorship gives far better outcomes to the best- 
ranked agents than to any others. Surprisingly, A-CEEI-TP 
follows a similar pattern in this case, although we did ob¬ 
serve that for some other game types (not shown), A-CEEI- 
TP’s curve gives better outcomes to low-ranked agents than 
RSD. In the HBS draft, a “shelf” of high utility for the sev¬ 
eral best-ranked agents is typical, as all of the team captains 
receive similarly high utility, with a steep drop-off in utility 
for non-captain agents. In the OPOP draft, in contrast to all 
others, the utility curve is far more flat across random se¬ 
rial indexes: even the agents with high (bad) serial indexes 
achieve moderately good outcomes for themselves. 

Some mechanisms for team formation tend to give better 
outcomes to an agent that is “popular,” having a high mean 
value to the other agents. Eor example, random serial dicta¬ 
torship biases outcomes in favor of popular players, because 
even if a popular player is not a team captain, it is likely 
that this player will be selected by some team captain along 
with other desirable players. An unpopular player, however, 
will likely be left until near the final iteration of RSD, to 
be selected along with other unpopular players. Therefore, 














5 10 15 20 

Random Serial Index 



Mean Rank by Others 


Figure 1: Left: Mean fraction of total utility earned versus 
the random serial index of the agent. Fraction of total utility 
is between 0 (worst) and 1 (best). A cubic smoothing spline 
is applied. Right: Mean rank of an agent’s teammates, versus 
mean rank of the agent by other agents. Possible ranks range 
from 1 (best) to 20 (worst). Each point represents a sin¬ 
gle agent’s mean outcome. Best-fit lines use ordinary least 
squares. 


we might expect RSD to yield better outcomes to popular 
players, especially when agents’ preferences are highly sim¬ 
ilar. To quantify this intuition, we plot in Figure (right) the 
mean rank of an agent’s teammates according to the agent’s 
preferences, versus the agent’s mean rank assigned by the 
other agents. Each point in the scatter plot represents a sin¬ 
gle agent’s mean outcomes across 20 instances of Random- 
similar preferences, held in common across the mechanisms. 
We find best-fit lines via OLS regression, for each of RSD, 
HBS draft, and OPOP draft. The results indicate that, as ex¬ 
pected, RSD offers better outcomes to popular agents than 
to unpopular ones, with a distinctly positive trend line. The 
HBS draft and OPOP draft appear less biased for or against 
popular agents, with OPOP showing slightly lower correla¬ 
tion than HBS between an agent’s popularity and the mean 
value of its assigned team. 

Conclusion 

We considered team formation as a mechanism design prob¬ 
lem, in which the mechanism elicits agents’ preferences 
over potential teammates in order to partition the agents into 
teams. The teams produced should have high social welfare 
and fairness, in the sense that few agents should prefer to 
switch teams with others. We proposed two novel mecha¬ 
nisms for this problem: a version of approximate compet¬ 
itive equilibrium for equal incomes (A-CEEI-TE), and the 
one-player-one-pick draft (OPOP). We showed theoretically 
that A-CEEI-TE is strategyproof-in-the-large and approxi¬ 
mates envy-freeness. OPOP lacks these theoretical guaran¬ 
tees but empirically outperformed A-CEEI-TE in truthful¬ 
ness and fairness, as well as in social welfare for data sets 
with sufficiently dissimilar agent preferences. In addition, 
OPOP surpassed other mechanisms tested, including ran¬ 
dom serial dictatorship and the HBS draft, in social welfare 
and fairness. The HBS draft, however, produced slightly bet¬ 
ter truthfulness that the OPOP draft. Given the relative sim¬ 
plicity of implementing OPOP, this mechanism emerges as 
a strong candidate for team formation settings. 
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Appendix 

Team Size Constraints that Admit a Feasible 
Partition 

Some tuples {N, k, k) are not feasible, meaning that it is not possi¬ 
ble to divide \N\ players into teams with sizes in [k, k]. Recall that 
we require by definition 1 < k < k < |A^|. For a minimal exam¬ 
ple, it is not possible to divide 3 players into teams with k — 2, 
k = 2. 

Observation 1. A tuple {N,k, k) is feasible if and only if: k di¬ 
vides I A|, k divides | A'l, or (| A"! \fc) > (| A| \fe), where \ signifies 
integer division. 

Computing A Social Welfare-Maximizing Partition 

In order to find a social welfare-maximizing partition, we formulate 
the problem as an MIP and solve it using CPLEX. We are given 
a feasible team formation problem as (A, >~,k, k), where either 
k = k,oxk + l = k and neither k nor k divides | A|. Let T equal 
the number of teams that results when as many teams of size k as 
possible are formed, the rest being of size k. 

We introduce a matrix x G where each row corre¬ 

sponds to one team, and the 1 values in the row indicate which 
agents are on that team. We also introduce a dummy variable, 
S G {0,where S'tij = 1 if agents i and j are both on 
team t, otherwise 0. Sij G {0,1} 1^11^1 is 1 if and only if agents 
i and j are on the same team, and is the sum over T values of Suj 
for i and j. 

The MIP for maximize social welfare is then: 

|JV| \N\ 

max . ^ ^ ' SijlyUij -t- uji'j 

i=i j=i+i 

subject to: 

X G 

SG {0,ir"l^l"l^l 

2Stij < Xu -\-Xtj ■■ 

Stij > Xu -\-xtj - I ■■ Vf, i, j 

T 

:Vi,i 

t = l 
|iV| 

k < xtj < k ■. Vf 


T 

We implemented a second version of the maximize social wel¬ 
fare MIP, which appears to run markedly faster. In this version, a 
matrix x G {0, has a 1 in row i for each agent on the 

team of agent i. We require that Xij = Xji so that demands are re¬ 
ciprocal. We also require that each agent demand itself, so xu = 1. 
Each team size must be in [k, fc], so the sum of each row of x must 
be in this range. Finally, we require that any two rows in x either 
not have a 1 in any of the same columns, or must be identical; this 
means that if some agent i appears on two teams (i.e., in two rows), 
those teams must contain exactly the same agents. 

|iV| |Ar| 

max : EE XijUij 

i=l i=l 

subject to: 

Xu = 1 ■. Vi 
Xij - Xji = 0 : Vi, j 

|iV| 

k < Xij < k ■.'ii 

1=1 

Xij -\-Xiij Xij, - Xi,j, < 2 : Vi,i' 7^ ijj' 7^ j 


Proof of Proposition!^ 

Consider a team formation problem with 6 agents, 
{A, B,C, D, E, F}, k = 3, k — 3, so that two equal-size 
teams must be formed. The agents’ additive separable preferences 
are encoded in Table 



A 

B 

C 

D 

E 

E 

A 

X 

0 

1 

2 

4 

8 

B 

8 

X 

4 

2 

1 

0 

C 

8 

0 

X 

4 

2 

1 

D 

8 

1 

0 

X 

4 

2 

E 

8 

2 

1 

0 

X 

4 

F 

8 

4 

2 

1 

0 

X 


Table 8: Each row i encodes the additive separable value for 
agent i of each other agent. 

No partition of these agents into two teams of size 3 gives every 
agent envy bounded by a single teammate. To see this, consider 
that each agent other than A has a bliss point on a team with A 
and one other agent, where the second agent is C for agent B, D 
for agent C, and so on until “wrapping around” with B for agent 
F. Three of the agents will not be on a team with agent A, and at 
least one of these agents, say agent i, will not be on a team with 
its second-favorite agent either. Some other agent j must then be 
on a team with the two most-preferred agents of the player i. By 
construction, player i is on a team of value 3 or less, while the team 
of agent j has value 12 to agent i, and value 4 to agent i with its 
more valuable player (player A) removed. Therefore, envy cannot 
be bounded by a single teammate for all agents. □ 




Maximin Share Guarantee The maximin share guarantee is 
a concept from multi-unit assignment that can be applied to he¬ 
donic games; in multi-unit assignment, “bundles” of “items” are 
allocated to agents instead of “teams.” A maximin share for an 
agent is the agent’s least-preferred bundle in a maximin split for 
the agent. A maximin split for agent i is a partition of all items into 
bundles such that each agent can receive one bundle, where the par¬ 
tition maximizes the utility to i of the least-valuable bundle. Note 
that envy-freeness in a split for an agent implies proportionality, 
and proportionality implies maximin shares dProcaccia and Wangl 
|M4). 

We introduce a modified version of the maximin share guaran¬ 
tee in our setting, the maximin share guarantee for team forma¬ 
tion. A partition in a team formation problem provides maximin 
shares for team formation, if each agent weakly prefers its team to 
its maximin share. Given {N, k), consider all partitions of 

the agents N with team sizes in [k, k]. A maximin split for agent 
i is any such partition that maximizes the value for i of the least- 
preferred team containing i, which would result from swapping i 
with some agent j, where j may equal i. A maximin share is the 
least-preferred team in a maximin split. This definition of the max¬ 
imin share guarantee preserves a useful property of the guarantee 
for multi-unit assignment, which is that an agent may be assigned 
one of I A^l teams from its maximin split (up to the loss of the other 
agent in the swap from a team, in the case of team formation). 

The maximin share guarantee for team formation, in the case 
of mechanism design for team formation problems (MDTFs) with 
non-negative values, is trivially satisfiable for fc = | Wl: If the grand 
coalition is assigned, every agent’s payoff is maximized, so envy- 
freeness is achieved. The maximin share guarantee for team for¬ 
mation, in the case of MDTFs with non-negative values, is also 
trivially satisfiable for k = 2, where \N\ is even. In the maximin 
split for an agent with k = 2, all agents are grouped in pairs (as¬ 
suming \N\ is even), and the agent’s maximin share is the pair with 
its least-favorite other agent. Thus, any grouping of the agents into 
pairs satisfies the maximin share guarantee. 

Proposition 7. For some MDTFs with non-negative values, k > 3, 
and k < |A^|, the maximin share guarantee for team formation is 
not satisfiable. As a result, proportionality and envy-freeness are 
not satisfiable either in such cases. 

Proof. Consider a team formation problem with 6 agents, 
{A, B, C,D,E,F},k = 3, k = 3, so that two equal-size teams 
must be formed. The agents’ additive separable preferences are en¬ 
coded in Table 13 



A 

B 

C 

D 

E 

F 

A 

X 

1 

0 

2 

3 

2 

B 

2 

X 
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0 

3 

2 

C 

2 

2 

X 

1 

0 

3 

D 

1 

2 

2 

X 

2 

1 

E 

0 

2 

3 

1 

X 

2 

F 

2 

1 

2 

2 

1 

X 


Table 9: Each row i encodes the additive separable value for 
agent i of each other agent. 

It is easy to see that in the problem described above, each agent’s 
maximin share for team formation has value 3. Furthermore, any of 
the 10 possible partitions of the agents into two equal-size teams 
leaves some agent with utility of 2 or less. Therefore, no partition 


for this game provides every agent with a maximin share. In con¬ 
sequence, no partition for this problem is proportional or envy-free 
either. □ 

Proof of Proposition]^ 

Random serial dictatorship for team formation is strategyproof, 
meaning that it is a dominant strategy for each agent to report its 
true values for other agents, regardless of the other agents’ reports. 
Consider that each agent affects the partition returned by RSD only 
if the agent is a team captain, based on the serial order over play¬ 
ers and the choices of earlier team captains. Therefore, if an agent 
is not a team captain, the agent’s report makes no difference. If 
an agent is a team captain, the agent is assigned its most-preferred 
team based on the remaining players, team size constraints, and the 
agent’s reported preferences. Therefore, it is a dominant strategy 
for the agent to report its true preferences. 

Random serial dictatorship is ex post Pareto efficient for MDTFs 
with strict preferences over teams (i.e., without indifferences over 
teams). The first team captain is assigned its strictly most-preferred 
team, of the largest feasible team size. Thus, for non-negatively 
valued MDTFs, no other agent could swap or take agents from the 
first captain’s team without decreasing its value to the captain. The 
same argument holds, by induction, for later team captains. Thus, 
the partition returned by RSD is ex post Pareto efficient. 

Random serial dictatorship for team formation is not envy-free, 
because for some problem instances, no envy-free partition exists, 
as shown in Proposition]?] □ 

Proof of Proposition]^ 

The Harvard Business School draft for team formation is not envy- 
free, because for some problem instances, no envy-free partition 
exists, as shown in Proposition]^ 

The HBS draft for team formation is not ex post Pareto efficient. 
For a counter-example, consider the following team formation 
problem, with k = k = 3 and player order (A, B, C, D, E, E), 
such that A and B will be the team captains. The agent preferences 
are encoded in Table 1 101 



A 

B 

C 

D 

E 

F 

A 

X 

1 

8 

0 

6 

4 

B 

1 

X 

10 

0 

5 

3 

C 

0 

8 

X 

5 

4 

2 

D 

0 

8 

5 

X 

4 

2 
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8 

0 

5 

4 

X 

2 

F 

8 

0 

5 

4 

2 

X 


Table 10; Each row i encodes the additive separable value 
for agent i of each other agent. 

In this example, the following selections are made in the HBS 
draft. A selects C, B selects E, B selects E, and then A selects 
D. The resulting partition is {{ACD), (BEE)}. This partition is 
not ex post Pareto efficient, because all the agents would prefer the 
partition {{AEF), {BCD)}. 

The HBS draft for team formation is not strategyproof. Con¬ 
sider a minimal-size example of a problem instance with non¬ 
negative values, where the HBS draft is not strategyproof. Let 
N = {A, B, C, D, E, E}, fe = 3, and k = 3. The players’ ad¬ 
ditive separable values to each other are encoded in Table|l 1| 

In the problem shown in Table ]TT| agent A gets greater utility 
by misreporting its preferences as m A! for 24 serial orders over 
the players, and never receives worse utility than when reporting 
A. In each of the 24 cases where A gains by reporting A!, utility 











A 

B 

c 

D 

E 

F 

A 

X 

5.0 

4.9 

7.0 

0.2 

0.0 

A' 

X 

5.0 

6.0 

7.0 

0.2 

0.0 

B 

0.0 

X 

1.1 

1.6 

1.2 

1.3 

C 

0.0 

1.1 

X 

1.6 

1.2 

1.3 

D 

0.0 

1.1 

1.6 

X 

1.2 

1.3 

E 

0.0 

1.1 

1.6 

1.2 

X 

1.3 

F 

0.0 

1.1 

1.6 

1.2 

1.3 

X 


Table 11: Each row i encodes the additive separable value 
for agent i of each other agent. Row A! shows the false re¬ 
port of agent A. 

improves from 5.2 to 9.9, as the received team shifts from {ABE) 
to {ABC). This occurs when the first player to choose is A and 
the second is D. Therefore, the HBS draft is not strategyproof, be¬ 
cause in the example shown, player A gets better expected utility 
by misreporting its preferences. □ 

Proof of Proposition]^ 

Consider a minimal-size example of a team formation problem 
with non-negative values, where the OPOP draft is not strate¬ 
gyproof. Let N = {A, B, C, D, E, E}, fe = 3, and fc = 3. The 
players’ additive separable values to each other are encoded in Ta¬ 
ble [12] 



A 

B 

c 

D 

E 

F 

A 

X 

5.0 

4.9 

7.0 

0.2 

0.0 

A' 

X 

5.0 

6.0 

7.0 

0.2 

0.0 

B 

0.0 

X 

1.1 

1.6 

1.2 

1.3 

C 

0.0 

1.1 

X 

1.6 

1.2 

1.3 

D 

0.0 

1.1 

1.6 

X 

1.2 

1.3 

E 

0.0 

1.1 

1.6 

1.2 

X 

1.3 

F 

0.0 

1.1 

1.6 

1.2 

1.3 

X 


Table 12: Each row i encodes the additive separable value 
for agent i of each other agent. Row A! shows the false re¬ 
port of agent A. 

The OPOP draft is a randomized mechanism, which works by 
uniformly randomly choosing a permutation of the players, then 
acting deterministically based on that order. Strategyproofness for 
a randomized mechanism means that a player’s expected payoff is 
maximized by truthful reporting, regardless of other players’ ac¬ 
tions. To show that the OPOP draft is not strategyproof, it suffices 
to show that player A gains in expected payoff, based on the true 
preferences in row A of the table, by misreporting its preferences 
as in row A! in the table, if other players report their preferences as 
displayed in the table. 

The example problem has 6 players, so there are 6! = 720 per¬ 
mutations, and 720 equally likely outcomes from the OPOP draft, 
some of which are identical in their induced partitions. It turns out 
that the payoff for player A is better when reporting Al instead 
of A for 18 of 720 permutations, and worse in 6 others. Specifi¬ 
cally, player A performs better by lying if and only if the order of 
the first three players to choose is {ADB), {ADE), or {ADE). 
Player A does worse by reporting A^ if and only if the order of the 
first three players is {ADC). After factoring in the value of the re¬ 
sulting partitions to player A, it results that the expected gain from 
defecting to A' is 0.08 utils per game. Thus, the OPOP draft is not 
strategyproof. 


The One-Player-One-Pick draft for MDTFs is not envy-free, be¬ 
cause for some team formation problem instances, no envy-free 
partition exists, as shown in Proposition]^ 

The OPOP draft for MDTFs is not ex post Pareto effi¬ 
cient, as shown in the following minimal example. Let N = 
{A, B, C, D, E, F}, k = 3, and k = 3. Let the serial order of 
the players be alphabetical, so A selects first, and so on. The play¬ 
ers’ additive separable values to each other are encoded in Table 

M 



A 

B 

c 

E> 

E 

E 

A 

X 

2 

10 

9 

6 

0 

B 

0 

X 

10 

9 

2 

6 

C 

0 

10 

X 

2 

6 

9 

D 

10 

0 

2 

X 

6 

9 

E 

10 

0 

2 

6 

X 

9 

F 

0 

10 

2 

6 

9 

X 


Table 13: Each row i encodes the additive separable value 
for agent i of each other agent. 

In the example problem in Table [T3| the following selections 
are made, in order. Team captain A selects C, team captain B se¬ 
lects D, C (already teamed with A) selects F, and as a result D 
(already teamed with B) must select E. This produces the parti¬ 
tion {{ACE), {BDE)}. Every player would receive greater util¬ 
ity from the alternative partition {{ADE), {BCE)}. Therefore, 
the OPOP draft is not ex post Pareto efficient. □ 

Details of A-CEEI-TF 

Each agent reports to the mechanism the additive separable utility 
it gains from each other agent. The mechanism uses these reports 
to derive which affordable team each agent demands, given each 
agent’s budget and price. The total utility for an agent i of a team 
of other agents of size in [fc — 1, fc — 1], is the sum of the utilities 
for agent i of the agents in that team. The clearinghouse will pro¬ 
visionally assign agent i the team of highest utility for i that it can 
afford, or the empty team if it cannot afford any team of legal size. 

Just as in combinatorial matching, we cannot in general hope 
for an exact market clearing solution (and, consequently, an exact 
CEEI) in our setting: 

Proposition 8. There exist team formation settings where no price 
and budget vectors {p, b) exist that induce exact market clearing. 

Proof. We define exact market clearing for team formation to 
mean that when each agent is allocated its favorite team that is 
affordable: 

• Each agent is assigned to a team with size in [k, k] , including 
itself in the team size. 

• Eor any two distinct agents i and j, i demands j if and only if j 
demands i. 

This result marks a distinction from the course allocation prob¬ 
lem (also known as combinatorial assignment), where with suffi¬ 
ciently unequal budgets, some price vector must exist that induces 
exact market clearing jBudish 2011] . 

Example. Consider a team formation problem with 4 agents, 
{A, B, C, D}, with k = 2 and k = 2. The agents’ additive sepa¬ 
rable preferences are encoded in Table|14| 

In the example in Table JT^ only two resulting partitions 
must be considered, due to symmetry: {{A, B}, {C, D}} and 
{{A,C},{B,D}}. 











A 

B 

c 

D 

A 

X 

2 

1 

0 

B 

0 

X 

2 

1 

C 

1 

0 

X 

2 

D 

2 

1 

0 

X 


Table 14: Each row i encodes the additive separable value 
for agent i of each other agent. 

In order for the market to clear as {{A, B}, {C, D}}, each agent 
must be able to afford its assigned partner, and each agent must 
not be able to afford any other agent that it prefers. If we assume 
the market clears in this way, the resulting inferences produce an 
impossible conclusion: 

Pa < bs < Pc < bo < Pa 

A similar conclusion can be drawn from the other partition, 
{{A,C}, {B, D}}. We find that no {b,p) induces exact market 
clearing for this problem, no matter how unequal the agents’ bud¬ 
gets in b are. □ 

Define the price update error as 

z{p)j = (1 + e - {€/b)pj)Dj - Uj. 

Since exact market clearing is not feasible, we make use of a relax¬ 
ation. We define a notion of relaxed market clearing, which we will 
use in our search for approximately market-clearing prices. This 
form of market clearing allows a player on the market to be under¬ 
demanded if and only if its price is 0. We adapt this relaxed form 
of market clearing from Budish ( [Budish 201 1[ >. In relaxed market 
clearing: 

• Each agent is assigned to a team with size in [k, k] , including 
itself in the coalition size, unless the agent has price 0, in which 
case it may alternatively be assigned the team containing only 
itself. 

• For any two distinct agents i and j, i demands j if and only if j 
demands i. 

Proposition 9. There exists a point p* in price space V, for which 
a convex combination of the error terms z{p') induced by some 
set of prices near p* equals the zero vector, as in relaxed market 
clearing. 

Proof. The proposition can be proved through a straightforward 
adaptation of the proof of Theorem 1 in Budish ( [Budish 201 l[ l. 
First we note that any fixed point of /tf (•) induces relaxed market 
clearing. Furthermore, the price space V is compact, convex, and 
closed under fTF{-). We introduce a function F{p) that takes the 
convex hull of all values fTF{p'), for p' approaching in the limit 
but not equal to p. 

Lemma 2.4 of Cromme and Diener is useful here ( jCromme andj 
[Diener 1991| >. Take any space X that is a non-empty, compact, con¬ 
vex subset of and any function frF : X ^ X, such that X is 
closed under /t p . Define Hf : — >■ S', S' C ff, to be the function 
that takes any x 'm X and yields the set: 

{y G X\3 a sequence xt ^ x,Xi x : fxFixi) -A y} 

Lemma 2.4 of Cromme and Diener states that for any such func¬ 
tion fxF, there exists a point x* € X such that x* is in the convex 
hullofB/(x*). 


We can apply Cromme and Diener’s result to F{-), because the 
auxiliary price space V is non-empty, compact, convex, and closed 
under fTF^), and F{-) is defined as the convex hull of Cromme 
and Diener’s Hf. Therefore, as shown in Cromme and Diener, 
Kakutani’s fixed point theorem implies that F{-) has a fixed point 
p* G V. Thus, a convex combination of the error terms z{p') near 
t{p*) yields relaxed market clearing. □ 

The fixed point result in Proposition implies that for prices 
p' in the neighborhood of the truncation m a fixed point p* of the 
price update function, induced market clearing error will be low. 
Therefore, we will likely be able to find price vectors that yield 
low market clearing error by iteratively applying /tf(-) to its own 
output, until an apparent fixed point is approached. 

Proof of Proposition]^ 

First, we show that auxiliary price space V is closed under fTF{-). 
The range of z{p)j for p G V is [—1, {\N'\ — 1)(1 + e)]. Thus, 
assuming e is small, \z(p)j /\N'\\ < 1. Therefore, it is easy to see 
that auxiliary price space [—1,1 -|- 6]^^ ^ is closed under /tf(-)- 

Next, we show that if p € E is a fixed point of /tf(-)> then 
p — t{p) must induce relaxed market clearing. 

Each agent price pj in p = t{p) must either equal 0, equal b, or 
be in (0, b)^ 

If Pj = b, then no agent can afford to demand agent j, so Uj = 1 
and Dj = 0, implying that z{p)j is negative, so Pj cannot be a 
fixed point. If pj has been truncated by tf), this would reduce it, 
and zf) further reduces it. 

If Pj = 0, then if there is Dj > 0 and Uj — 0, the net effect 
of z(p)j is to increase pj, and if pj has been truncated, then that 
would have increased pj also; so with or without truncation, pj 
would not be a fixed point if it had positive D but not U. If Pj = 0 
and Dj = 0, then under-demand for j is allowed because the price 
of j is zero, so this does not violate relaxed market clearing. Dj and 
Uj cannot both be positive, because if there is incoming demand, 
there is no under-demand by definition. 

If Pj G (0, b), and pj is part of a fixed point p, then Uj — 0 
and Dj = 0, because by definition, they cannot both be positive at 
once. □ 

Proof of Proposition]^ 

First, we sketch a proof of strategyproofness-in-the-large. 

Any individual agent in the continuum economy, being zero- 
measure, has no influence on the approximate equilibrium price 
vector arrived at by update function fTFf), at any iteration of the 
A-CEEI-TF mechanism. Therefore, the only effect the agent can 
have on the outcome is that, if the agent is randomly selected to 
choose its favorite affordable team of free agents that leaves a feasi¬ 
ble subproblem, the agent’s reported preferences determine which 
team the agent is assigned. Thus, it is a dominant strategy for the 
agent to report its true preferences, so that in this case the agent 
will be assigned its most-preferred allowable team. □ 

Now we deal with approximate envy-freeness. 

Consider a team formation problem where A-CEEI-TF yields 
exact market clearing with the same partition at each stage. If all 
agent budgets were equal, then the result would be envy-free. To 
see this, consider that each agent’s team assigned by A-CEEI-TF 
in such cases is the same as the team induced by prices in the ini¬ 
tial round of price search. In this initial round, every agent is able 
to afford the team of every other agent, because all agent budgets 
are equal, and all agents’ demands are consistent (i.e., simultane¬ 
ously satisfiable), because there is exact market clearing. Because 
the final partition returned is the same as that induced by the initial 
prices, no agent must envy any other in the final partition. 












More generally, consider the same setting but without equal bud¬ 
gets. That is, consider a team formation problem with non-negative 
values and team size constraints, where A-CEEI-TF yields exact 
market clearing with the same partition induced at each stage. Re¬ 
call that all agents’ budgets are in the range [1, 6}, where b = 
1-|-(1/| A|). Because 1 < fc < \N\, this implies that b < l-|-(l/fc). 

No agent i can envy another agent j in its own team, because 
j’s team (excluding the costs of i and j) costs weakly less than 
the team of i (excluding the cost of i), as all agent prices are non¬ 
negative, so i could afford the team of other agents that j receives 
(the team of agent i, excluding agents i and j). 

Assume for a contradiction that i has envy for agent j that is not 
bounded by a single teammate. Note that i and j must be assigned 
to distinct teams. Let k' be the number of other agents in j’s team, 
k' < k. Let ji be the first other agent in j’s team, and so on through 
jk* • 

Let p be the exact market clearing prices that are found in the 
first stage of the A-CEEI-TF mechanism. Let x be the partition 
that is induced by the mechanism, which we have assumed to be 
consistent across all stages of the price search. Xi is then a vector 
in {0,1}'^' that has a 1 for each agent on the same team as agent 
i. For convenience in calculating the cost of the team for agent i, 
we let Xii = 0 for all i, because an agent is not required to pay the 
cost of teaming with itself. Let {xi \ {y}) be a vector equivalent to 
Xi, but with Xij set to 0. 

P ■ (*i \ {jV}) > bi •. Wr £ {1,, k'}. This is because if 
agent i’s envy for agent j is not bounded by a single teammate, 
agent i must not be able to afford any subset of j’s team created by 
removing one agent (and excluding agent j). 

(k' — l)p ■ Xj > k'bi. This comes from adding the L.H.S. and 
R.H.S. of the k' inequalities above, where the subtracted teammates 
from Xj cancel to produce one fewer complete Xj. 

bj > p ■ Xj, because agent j can afford its own team. 

(fc' — l)bj > k'bi. Replacing p ■ Xj with bj in the inequality 
above. 

bj/bi > k'/_{k' -1). 

Recall that b < l-|-(l/fc), and fc > 1, meaning that b < 1 -f 
(l/(fc — 1)). This implies that k/{k — 1) >b. 

k > k', which implies kk' — k < kk' — k' . So k{k' — 1) < 
k'{k — 1). This means k/{k — 1) < k'/{k' — 1). 

bj/h > k'/{k' - 1) > k/{k - 1) > 6. 

bj/bi > b. 

This contradicts the claim that all budgets are G [1, b]. There¬ 
fore, envy must be bounded by a single teammate in a team forma¬ 
tion problem with non-negative values and team size constraints, 
where A-CEEI-TF yields exact market clearing with the same par¬ 
tition at each stage. □ 

A-CEEI-TF Implementation 

Unlike the alternative mechanisms, A-CEEI-TF is far from 
straightforward to implement. Below we describe an implementa¬ 
tion which uses ideas from |Othman, Sandholm, and Budish] ( |2010| l. 

To run the A-CEEI-TF mechanism, we used a modified form 
of the procedure described by Othman, Sandholm, and Budish, 
which alternates tabu search and mixed-integer program (MIP) 
solving until an approximate fixed point in price space is found 
(Othman, Sandholm, and Budish 20T()| . The mechanism starts by 
randomly assigning approximately equal budgets to the agents 
G (100,100 -I- 100/|A|), and setting prices for all agents to be 
equal to the minimum agent budget divided by the maximum num¬ 
ber of agents that can be legally demanded, {k — 1). Based on the 
random order of players, the next unselected player in order is al¬ 
lowed to select its favorite affordable team of unselected players, 
of legal size and that leaves a feasible subproblem. Before a player 


can make this selection, the tabu search-MIP hybrid algorithm is 
run to assign the prices to remaining agents. 

Each time the price search procedure is called, 20 iterations of 
tabu search are run. For a given iteration, the current price vector 
is “expanded” to yield up to 5|A| + 5 new price vectors, each of 
which is evaluated in terms of its relaxed market clearing error. 

The new price vectors are generated as either unilateral devia¬ 
tions from the current price vector, or moves in the direction of 
the “gradient,” or z{-). The gradient neighbors of p are gener¬ 
ated by moving in the direction of z(p), for each of the step sizes 
(10.0, 5.0,1.0, 0.5,0.1), where the step size is just the L 2 -norm of 
the change in p. For each agent j, we generate unilateral neighbors 
of p by setting the agent’s price to 0 if z{p)j < 0, otherwise by 
increasing the agent’s price by each of (1.0, 0.5, 0.1,0.05, 0.001). 
Any neighbor prices outside of [0, 6] are tmncated to this range. 

Each neighbor price vector of the previous price p is evaluated 
for its market clearing error, using an MIP solver to determine 
which team each agent would demand at those prices. We used 
CPLEX vl2.5, with Concert bindings for Java, to solve each MIP. 
Market clearing error for a price vector p is defined as the L 2 -norm 
of z{p). 

A “step” of the tabu search entails moving from the previous 
price vector to the price vector of lowest error among the neigh¬ 
bors, that is not in the tabu list (a list of the previously visited price 
vectors). After 20 iterations, the visited price vector that produced 
the lowest (best) market clearing error is returned. 

To find which team an agent j will demand, given a set of legal 
team sizes that leave feasible subproblems, we first break the legal 
team sizes into groups of consecutive integers. For example, if fc = 
2,k — 4, and there are 4 agents remaining including the “self” 
agent, it is not legal for the agent to demand 2 other agents for a 
team of size 3, even though 3 G [2,4], because this does not leave 
a feasible subproblem. So in this case we would first break up the 
problem into two distinct MIPs, one with fc = fc = 2, the other 
with fc = fc = 4. The agent’s preferred team from the union of the 
MIP result teams would then be returned as the agent’s demand. 

Now we have a price vector p over the M other remaining agents 
and a consecutive sequence of integers that are legal team sizes, 
{k, k + l,...,k}. The demand of agent j with budget bj is: 

|M| 

max : E CiUj{i) 

i=l 

subject to: 

c G {0, 

|M| 

ciPi < bj 

i=l 

[Ml 

k — 1 < a < k — 1 

i=l 






